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Abstract: The polyproteins of coronaviruses are cleaved by viral proteases into at least 15 
nonstructural proteins (Nsps). Consisting of five domains, Nsp3 is the largest of these (180-210 
kDa). Among these domains, the so-called X-domain is believed to act as ADP-ribose-1"- 
phosphate phosphatase or to bind poly(ADP-ribose). However, here we show that the X-domain of 
Infectious Bronchitis Virus (strain Beaudette), a Group-3 coronavirus, fails to bind ADP-ribose. This 
is explained on the basis of the crystal structure of the protein, determined at two different pH 
values. For comparison, we also describe the crystal structure of the homologous X-domain from 
Human Coronavirus 229E, a Group-1 coronavirus, which does bind ADP-ribose. 

Keywords: X-domain; macrodomain; nonstructural protein 3; coronavirus; SARS; ADP-ribose; X-ray 
structure; ADP-ribose-1 "-phosphate phosphatase 


Introduction 

Coronaviruses are positive-stranded RNA viruses with 
a genome size of 27-32 kb. Their enveloped, spherical 
virions have a diameter of 100-160 nm and carry 
prominent spike proteins that give the surface of the 
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particle a crown-like appearance in the electron micro¬ 
scope, hence the name coronaviruses. The genera Toro- 
virus and Coronavirus are members of the family 
Coronaviridae. This family belongs to the order Nido- 
virales, together with the Arterivirdae and Roniviridae 
families. Nidoviruses have similar genome organization 
and replication strategies but they differ in morphology, 
genome size, and several other biological properties. 1 

Coronaviruses have been subdivided into three 
main groups based on serological and genetic properties. 
Group 1 includes the human coronaviruses 229E (HCoV 
229E) and NL63 (HCoV NL63), the porcine Transmissi¬ 
ble Gastroenteritis Vims (TGEV), and feline coronavirus 
(FCoV). Although a Group-3 coronavirus was recently 
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isolated from a Beluga whale, 2 members of this sub¬ 
group generally infect birds, for example, the Infectious 
Bronchitis Virus (IBV) which can cause serious eco¬ 
nomic loss in the chicken industry. 3 The human corona- 
viruses OC43 (HCoV OC43) and HKUl (HCoV HKUl), 
Mouse Hepatitis Virus (MHV), and bovine coronavirus 
(BCoV) belong to Group 2. The 2003 outbreak of severe 
acute respiratory syndrome (SARS), caused by a corona- 
virus that was classified as an outlier in Group 2, 4 
renewed the interest in the coronavirus family. 

HCoV 229E and HCoV OC43 are responsible for 
10-30% of all common colds. HCoV NL63 causes re¬ 
spiratory tract illness in children below the age of 1 
year and in immunocompromised adults. 5 

The coronavirus genome contains several open 
reading frames (ORFs). The two overlapping ORFs la 
and lb at the 5' end comprise two thirds of the ge¬ 
nome. ORFia encodes the polyprotein la (ppia) and 
ORFia and lb together code for ppiab. Synthesis of 
the latter depends on a (—1) ribosomal frameshift dur¬ 
ing translation, which occurs just upstream of the 
ORFia stop codon. In most coronaviruses, the poly¬ 
proteins are processed by two papain-like proteases 
(PLi pro and PL2 pro ) and the main proteinase (M pro ), 
all of which are encoded within ORFia. The resulting 
15 (IBV) or 16 nonstructural proteins (Nsps) are asso¬ 
ciated with genomic and/or subgenomic RNA synthe¬ 
sis (replication and transcription, respectively). Some 
of them may also be involved in interactions with 
host-cell components. 

Several of the Nsps of SARS-CoV have had their 
three-dimensional structures determined within struc¬ 
tural proteomics programs (see Refs. 6 and 7 for 
reviews). However, structural information on other co¬ 
ronavirus Nsps is limited: Of HCoV 229E, the only 
structures known are those of the main proteinase 
(Nsps; Ref. 8) and of Nsp9- 9 Among the IBV Nsps, 
only the structure of the main proteinase 10 is known 
so far. As we will show here for the X-domain of 
Nsp3, complementary structural information on other 
coronaviruses than SARS-CoV can make a major con¬ 
tribution to understand the function of the Nsps of the 
entire family. 

With a molecular mass of 180-210 kDa, the 
multidomain Nsp3 is the largest cleavage product 
of ppia and ppiab. Its domain organization is more 
or less conserved among the coronaviruses. Several 
highly hydrophobic stretches in the C-terminal Y 
domain anchor Nsp3 to intracellular membranes. 11-13 
Immuno-EM studies revealed that most of Nsp3 local¬ 
izes to convoluted membranes, which are part of a 
reticulovesicular network of modified endoplasmic 
reticulum that is thought to support the viral replica¬ 
tion/transcription complexes. 14 

The N-terminal domain of Nsp3 comprises 150- 
240 amino-acid residues and is called acidic domain 
in view of the high number of Asp and Glu residues in 
its carboxy-terminal third. The first papain-like prote¬ 


ase (PLi pr °) is located downstream of the acidic do¬ 
main. Along with the PL2 pro , which is further down¬ 
stream, the PLi pro is responsible for the processing of 
three (IBV: two) cleavage sites containing LXGG'X 
consensus sequences in the N-terminal third of the 
coronaviral polyproteins. Interestingly, the PLi pro is 
missing in SARS-CoV, 4 and in IBV, it is catalytically 
inactive because of amino-acid replacements in the 
catalytic center. 13 

C-terminal to the PLi pro of most coronaviruses, 
there is the so-called X-domain, which was first 
described by Gorbalenya et al . 15 and was later proposed 
to be an ADP-ribose-i"-phosphate phosphatase (Appr- 
i"-pase). 4 Appr-i"-pases are involved in tRNA splicing 
by removing the phosphate group of adenosine diphos¬ 
phate ribose i"-phosphate, thus producing adenosine 
diphosphate ribose (ADP-ribose). 16 Appr-i"-pase activ¬ 
ity has been demonstrated in vitro for the isolated X- 
domains of HCoV 229E 17 and SARS-C0V. 18 ’ 19 The 
structure of the SARS-CoV Nsp3 X-domain (SARS-X) 
has been determined by Saikatendu et al ., 18 and, in 
complex with ADP-ribose, by Egloff et al . 19 

The fold of the SARS-X was found to be similar to 
that of the so-called macrodomain. This name goes 
back to the noncanonical domain in the histone mac- 
roH2A. 2 ° The macrodomain fold has also been found 
in X-ray structures of proteins from the thermophilic 
organism Archaeoglobus fulgidus (AF1521 protein; 
Refs. 21 and 22), yeast (Ymx7 protein; Ref. 23), and 
E. coli (Er58 protein). 

Here, we describe the crystal structures of the 
X-domains of the coronavirus Group-i member HCoV 
229E (229E-X) and the Group-3 coronavirus IBV 
(strain Beaudette; IBV-X), and compare them with the 
structure of the homologous SARS-X. We also investi¬ 
gate the binding properties for ADP-ribose and show 
that 229E-X binds this product of Appr-i"-pase activ¬ 
ity, whereas IBV-X (strain Beaudette) fails to do so. 

Results 

ADP-ribose binding studies 

Using both isothermal titration calorimetry (ITC; Fig. 
1) and zone-interference gel electrophoresis (ZIGE; 
results not shown), recombinant 229E-X was found to 
bind ADP-ribose with a K a of 28.9 ± 2.6 | xM. This 
compares well to the value of 24 pM determined for 
SARS-X. 19 In contrast, IBV-X (strain Beaudette) failed 
to show any binding of ADP-ribose using either tech¬ 
nique (data not shown). 

Crystallization and structure determination 

We determined the crystal structure of IBV-X (strain 
Beaudette) at two pH values, 5.6 and 8.5. The crystals 
were grown under different conditions, using ammo¬ 
nium sulfate and polyethylene glycol (PEG) 4000, 
respectively, as precipitants (see Table I). The low-pH 
crystals displayed space group P3 2 2, with a = 78.2 A 
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Figure 1 . Isothermal titration calorimetry profile for the 
binding of ADP-ribose to HCoV-229E X-domain. 

and c = 81.7 A, whereas the high-pH crystals belonged 
to space group 0222^ with a = 42.2 A, b = 79.8 A, 
and c = 99.7 A. The crystals diffracted X-rays to Bragg 
spacings of 2.1 A and 1.6 A, respectively. Both crystal 
forms are different from the IBV-X crystals reported 
very recently by Wei et al , 24 In what follows, our low- 
pH and high-pH forms of the IBV-X will be abbrevi¬ 
ated by IBV-X5.6 and IBV-X8.5, respectively. 

In addition, we crystallized 229E-X, in order to 
compare the structures of X-domains from a Group-1 
(HCoV 229E) and a Group-3 (IBV, strain Beaudette) 
coronavirus, along with the known structure 18,19 of the 
corresponding domain of SARS coronavirus, which 
belongs to Group 2b. 25 The 229E-X crystals were 
grown from 20% PEG 8000, in the presence of 5% 
2-methyl-2,4-pentanediol (MPD) at pH 8.5; they dis¬ 
played space group f > 2 1 with a = 33.6 A, b = 65.9 A, c 
= 38.0 A, and (1 = 110.t°. The diffraction limit of these 
crystals was at 1.8 A resolution. 

All structures were determined by molecular 
replacement. In case of 229E-X, our structure of the 
X-domain of HCoV NL63 (NL63-X; Piotrowski et al., 
in preparation) was used as a search model, whereas 
solving the structures of both forms of IBV-X required 
the construction of a combined search model that 
made use of fragments of SARS-X 18 as well as of eu¬ 


karyotic macrodomains (see Materials & Methods). In 
view of the relatively low sequence identity between 
IBV-X and the other coronaviral X-domains (19% to 
SARS-X, 22% to 229E-X, 25% to NL63-X), and the 
structural deviations that we will describe later, it is 
no surprise, in retrospect, that straightforward molecu¬ 
lar replacement did not work in this case. 

Quality of the structural models 

All three X-domain structures described here are well 
defined, with crystallographic R factors ranging from 
0.16 to 0.20 CRfree values from 0.20 to 0.23) (see 
Table I). The resolutions of the structures reach 1.6 A for 
IBV-X8.5,1.8 A for 229E-X, and 2.1 A for IBV-X5.6. 

The HC0V-229E X-domain consists of 165 amino- 
acid residues and its gene construct coding carried an 
additional 28 residues (including a His6 sequence and 
a Tobacco Etch Vims (TEV) protease cleavage site) at 
the N-terminus. Because cleavage of the purification 
tag led to precipitation of 229E-X, uncleaved protein 
was used for crystallization. None of the extra residues 
have visible electron density, whereas all 165 residues 
of the authentic X-domain could be observed in the 
maps. Alternative conformations were detected for Asp 
10, Lys 56, Asp too, and Cys 157. 

In case of IBV-X, removal of the N-terminal puri¬ 
fication tag yielded soluble protein. As a result, four 
extra amino-acid residues (GAMA) were still present 
at the N-terminus of the X-domain. The X-domain of 
IBV (strain Beaudette) consists of 172 amino-acid resi¬ 
dues. When crystallization was performed at pH 8.5 
(IBV-X8.5), all amino-acid residues could be modeled 
except _ 3 GAMAPATCE 5 and Lys 172, for which electron 
density was lacking. Alternative conformations in this 
structure could be detected for Lys 51, His 72, Met 138, 
Glu 148, Thr 151, and Ser 160. Lys 31 has unusual (p/r|r 
angles. Located in a two-residue turn between helix at 
and strand (S3 (loop L4), this residue has well-defined 
electron density; its side-chain interacts with Asp 28. 

In the IBV X-domain structure derived from crys¬ 
tals grown at pH 5.6 (IBV-X5.6), all amino-acid resi¬ 
dues except .gGAMAPj could be modeled, although the 
density was well defined only from residue 6 onward. 
For Asp 135, an alternative conformation was refined. 
For crystallization at pH 5.6, 200 m M K-,Na-tartrate 
was used. During refinement of the structure, differ¬ 
ence density for two tartrate (TLA) molecules emerged. 
One molecule is located at loop L10 and is hydrogen- 
bonded to the backbone NH of Asp 117 and the side- 
chain of Arg 77. This tartrate is also involved in a crys¬ 
tal contact with Ser 47 and Gly 48 of a neighboring 
molecule. The second tartrate molecule is located at 
L5 and the N-terminus of cx2, where it interacts with 
the backbone amides of His 45, Gly 46, and Ser 47. In 
spite of the involvement of tartrate in crystal contacts, 
isomorphous crystals could also be grown in the ab¬ 
sence of this additive, but diffracted only to a Bragg 
spacing of 2.4 A. 
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Table I. Crystallization, Data Collection, and Refinement 



HC0V-229E X-domain 

IBV X-domain 

IBV X-domain 

Crystallization conditions 

Data collection 

20% PEG 8000 

0.1M Tris 
pH 8.5 

5% MPD 

30% PEG 4000 

O.lM Tris 
pH 8.5 

0.2M MgCL, 

1.8M (NH 4 ) 2 S 0 4 
o.lM Na-citrate pH 5.6 
0.2M K-,Na-tartrate 

Wavelength (A) 

1.04 

0.808 

0.808 

Resolution (A) 

24.22-1.78 (1.88-1.78) 

40.00-1.60 (1.64-1.60) 

31.28-2.10 (2.21-2.10) 

Space group 

Unit-cell parameters 

P2, 

C222 x 

P 3 a 2 

a (A) 

33-56 

42.21 

78.20 

b (A) 

65.89 

79.81 

78.20 

c (A) 

38.02 

99-74 

81.70 

a (°) 

90 

90 

90 

P(°) 

110.1 

90 

90 

Y (°) 

90 

90 

120 

Solvent content (%, v/v) 

33-73 

43.12 

66.88 

Overall reflections 

101,730 

158,270 

503,497 

Unique reflections 

14,479 ( 1946 ) 

22,201 (1448) 

17,123 (2231) 

Multiplicity 

4-2 ( 3 - 8 ) 

7.1 (6.2) 

11.5 (10.4) 

Completeness (%) 

96.9 (89.9) 

97-9 ( 97 -i) 

98.4 (89.4) 

Emerge" (%) 

5-2 ( 15 - 5 ) 

10.3 (24.4) 

7.9 (40.1) 

IMD 

Refinement 

19.6 (7.8) 

21.7 (6.8) 

25-6 (6.5) 

Resolution (A) 

24.22-1.78 

40.00-1.60 

31.28-2.10 

P b 

rv cryst 

0.165 

0.169 

0-199 

Rfree' 

r.m.s.d. from ideal geometry 

0.205 

0.203 

0.231 

Bonds (A) 

0.013 

0.011 

0.022 

Angles (°) 

1.284 

1-349 

1.896 

Protein atoms 

1283 

1325 

1335 

Solvent atoms 

Heteroatoms 

Ramachandran plot 

146 

212 

75 

20 

Most favored (%) 

91.7 

92.5 

92.1 

Additionally allowed (%) 

6-9 

6.8 

6.6 

Generously allowed (%) 

0.7 

0.0 

0.7 

Disallowed regions (%) 

0.7 

0.7 

0.7 


Values in parentheses are for the highest resolution shell. 

a Emerge = J2hkl J2i |J r (/lfcZ) i - { I(hkl )) |/ J2hkl X],V(frW),-, where I(hkl) is the intensity of reflection hkl and { I{hkl )) is the average 
intensity over all equivalent reflections. 

b Rcryst = flhki I F 0 (hkl) - F c (hkl)\/Ylhki Fo(hkl). .Rf ree was calculated for a test set of reflections (6%, 5%, and 6%, respectively) 
omitted from the refinement. 


In the final Fourier maps for IBV-X5.6, three 
regions of density could not be interpreted. The first 
of these is located on the twofold axis of symmetry, 
near the carboxy-terminal Lys 172 and the Phe 7-Tyr 
12 segment, as well as Arg 144 and Phe 167. It could 
represent two sets of twofold disordered glycerol mole¬ 
cules, as glycerol was used as a cryoprotectant, but 
this model was not stable during refinement. The sec¬ 
ond unexplained density is surrounded by Leu 159-Glu 
162, as well as Pro 81-Gln 88 of a symmetry-related 
molecule. The third noninterpreted density is located 
in the interior of the domain, close to His 45. It could 
represent a second conformation of this histidine, but 
this model could not be refined either. 

Overall structures 

The overall structures of 229E-X and IBV-X are simi¬ 
lar to one another, even though the sequence identity 


between the two is only 22% (see Fig. 2). They are 
also similar to the known structure of SARS-X (34% 
and 19%, respectively). The structures show the so- 
called macrodomain fold, consisting of a central, 
mostly parallel p-sheet flanked by three a-helices on 
one side and three on the other (see Fig. 3). Along the 
polypeptide chain, the order of regular secondary 
structure elements is pi-p2-ai-P3-a2-oi3-P4-P5-oi4-p6- 
ot5-P7-a6. The p-sheet topology is P1-P2-P7-P6-P3-P5- 
P4, with the external strands pi and P4 being antipar¬ 
allel to their neighbors. Interestingly, pi is missing in 
IBV-X. As a result, the positions of the N-terminal res¬ 
idues are very different (15.0 A between Leu 3 of 
229E-X and the corresponding Ala 2 of IBV-X5.6) in 
the two structures. 

The overall r.m.s. deviation between the X- 
domains of HCoV 229E and IBV (strain Beaudette) is 
1.81 A and 1.72 A for IBV-X5.6 and IBV-X8.5, 
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HCoV 22 9E 

SARS-CoV 

IBV 



•* * ••*••• * •••• * •• 


1 EKLNAFLVHDNVAFYQGD-VDTWNGVD-FDF IVNAANEN LAHGGGLAKALDVY TK 

1 PVNQFTGY LKLT DNVAI KCVDIVKEAQSAN- PMVIVNAANIHLKHGGGVAGALNKATN 

1-PATCEKPKFLEYKTCVGDLTWIAKALDEFKEFCIVNAANEHMTHGSGVAKAIADFCG 


u3 



u 4 


HCoV 22 9E 

SARS-CoV 

IBV 


55 GKLQRLSKEHIGLAGKVKVGTGVMVECDS-L — RIFNWGPR KGKHERDLLIKAYNTINN-E 

58 GAMQKESDDYIKLNGPLTVGGSCLLSGHN-LAKKCLHVVGPNLNAG-EDI-QLLKAAYENFN —S 

59 LDFVEYCEDYVKKHGPQQ - RLVTPS FVKG IQCVNNWGPR — HGDNNLHEKLVAAYKNVLVDG 


* * * ** * * * * * * *** 



P 6 a5 P 7 ci6 



* * ** **•* * * * 


HCoV 229E 113 QGTPLTPILSCGIFGIKLETSLEVLLDVC-NTKE-VKVFVYTDTEVCKVKDFVSG- 165 

SARS-CoV 118 QDILLAPLLSAGIFGAKPLQSLQVCVQTV—RTQ-VYIAVNDKALYEQWMDYLDNLKPRV 175 
IBV 119 WNYWFVLSLGIFGVDFKMSIDAMREAFEGCTIRVLLFSLSQEHIDYFDVTCK - 172 



Figure 2. Structure-based sequence alignment of the X-domains of HCoV 229E and IBV (strain Beaudette) with the 
homologue in SARS-CoV. Secondary structure elements of the HCoV-229E X-domain and the IBV X-domain are represented 
above and below the alignment, respectively. Amino acid residues labeled in cyan have been included in the calculation of 
the r.m.s.d. value. Asterisks above and below the alignment indicate aligned and conserved amino acid residues. 


respectively, for 146 Cot pairs out of 165 compared. 
The low-pH and high-pH forms of the IBV X-domain 
display a low overall r.m.s.d. of 0.46 A, the only signif¬ 
icant difference being caused by crystal contacts in 
loop region L11 connecting (16 and 015, with deviations 
of up to 11 A at Phe 132 and Gly 133. This region is 
involved in an extensive crystal contact in the low-pH 
but not the high-pH form. In 229E-X, this loop makes 
only weak, if any, crystal contacts and adopts a confor¬ 
mation similar to that seen in SARS-X 19 (see Fig. 3). 

The putative ligand-binding site 

Coronaviral X-domains have been proposed to bind 
ADP-ribose, either as product of an ADP-ribose-i"- 
phosphate phosphatase (Appr-i"-pase) activity, 17 or as 
part of poly(ADP-ribose) binding. 19 As described ear¬ 
lier, we found by ZIGE and ITC that 229E-X binds 
ADP-ribose, but IBV-X (strain Beaudette) fails to do 
so. To rationalize this observation on the basis of our 
crystal structures, we will inspect the putative binding 
sites for ADP-ribose in the two proteins in detail and 
compare them with the crystal structure of SARS-X in 
complex with ADP-ribose. 19 

In the latter structure, the adenine of ADP-ribose 
is recognized by a 2.9 A hydrogen bond from the exocy- 
clic NH 2 group of the base to the carboxylate of Asp 23 
[Fig. 4(A)]. This aspartate residue is highly conserved 
in coronavirus X-domains, including HCoV 229E and 
IBV. In the structures of IBV-X and 229E-X, both of 
which lack a bound ADP-ribose, this aspartate is ori¬ 
ented away from the binding site. In SARS-X, the ade¬ 
nine ring of ADP-ribose is sandwiched between the 
side-chains of Asn 157 [L13 ([17-76) loop], Val 50 [L5 


(|i 3 - 72 ) loop], and lie 24 [see Fig. 4(A)]. In IBV-X 
(strain Beaudette), Asn 157 is replaced by Leu 159, 
whereas Val 50 is conserved (Val 49), and lie 24 
becomes Leu 19. In 229E-X, Asn 157 is replaced by Tyr 
151, the side-chain of which is oriented along the puta¬ 
tive binding site for ADP-ribose but could easily reor¬ 
ient to interact with the adenine ring when present. Val 
50 is replaced by Leu 45, and He 24 by Val 19. 

In 229E-X, Glu 155 could interact with the 
hydroxy groups of the adenosine ribose when present 
(as it does in HC0V-NL63 X-domain; Piotrowski et al., 



Figure 3. Ribbon-representation of the superimposed X- 
domains of HCoV 229E (blue) and IBV (strain Beaudette) 
(green, pH 8.5; red, pH 5.6). N- and C-termini are indicated; 
the symbol “N” is framed for IBV-X. 
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Figure 4. A: Stereo representation showing the binding of 
ADP-ribose to the X-domain of SARS-CoV , 19 compared 
with the equivalent site in IBV-X8.5. The ADP-ribose 
molecule and the amino acid residues involved in the 
binding are represented as sticks. Carbon atoms of the 
ADP-ribose are colored white, carbon atoms of the protein 
are colored yellow and green, for SARS-X and IBV-X, 
respectively. Nitrogens and oxygens are colored blue and 
red, respectively. B: Residues 42 GGG 44 and 12 2 SCGIFG 12 7 
(L11) of 229E-X (sticks, blue carbon atoms), likely involved 
in binding ADP-ribose, superimposed onto the 
corresponding site ( 46 GSG 4 8 and 12 8 SLGIFGi 3 3 ) in IBV-X8.5 
(transparent surface colored according to electrostatic 
potential, green sticks). Residue labels for 229E-X are 
framed. Note that lie 131, and with it, the entire loop L11, in 
IBV-X is “pushed away” (red arrows) due to steric 
hindrance from Ser 47, thereby destroying the binding site 
for ADP-ribose. 

in preparation), but in SARS-X, this glutamate 
becomes Leu 161 and in IBV-X (strain Beaudette), His 
163, both of which are not capable of interacting with 
the proximal ribose. 

Further along the binding site for ADP-ribose in 
SARS-X, the cleft is lined by three consecutive gly¬ 
cines [nos. 47, 48, and 49; at the C-terminus of the 
P3-012 (L5) loop] on one side and the 12g SAGIFG 134 


segment on the other [Fig. 4(A)]. The amide of Gly 
48 makes a 3.0 A hydrogen bond to the Oi" of the 
distal ribose and Phe 133 packs against this sugar on 
its opposite face. The backbone amide groups of Ser 
129, Gly 131, lie 132, and Phe 133 donate hydrogen 
bonds to the phosphate oxygens. Most importantly, 
the side-chain of lie 132 is in close contact (3.5 A) 
with Gly 48, thereby forming a “bridge” over the 
binding cleft. 19 Even in the absence of ADP-ribose, 
this close interaction exists in SARS-X (4.1 A; 
Ref. 18). The “triple glycine” sequence is conserved in 
most coronavirus X-domains, with a few exceptions, 
including IBV (strain Beaudette, but not all IBV 
strains) and MHV, which have the second glycine 
replaced by Ser and Ala, respectively, and TGEV and 
FCoV, which have Met and Val, respectively, instead 
of the first glycine. The replacement of the “central 
glycine” by serine is correlated with dramatic struc¬ 
tural differences between the X-domains of IBV 
(strain Beaudette) and SARS-CoV: as the close con¬ 
tact to the conserved isoleucine (no. 131 in IBV-X) is 
no longer possible, the bridge across the binding cleft 
collapses, and the entire fS6-a5 (Lit) loop carrying the 
12 8SLGIFG 133 segment is pushed away (see Fig. 4). As 
a consequence, the Ca atom of He 131 in the high-pH 
form of IBV-X is located 5.9 A away from the position 
of the corresponding He 132 in SARS-X [Fig. 4(A)]. 
In the low-pH form, this distance is 4.4 A, but as 
described earlier, the (S6-a5 (Lit) loop is involved in a 
crystal contact in this structure. 

As a consequence of the conformational changes 
of the Lit loop in IBV-X8.5 when compared with 
SARS-X (and 229E-X), at least five hydrogen bonds to 
the phosphates and the distal ribose, as modeled into 
the structure, are lost in IBV-X (strain Beaudette). 
Furthermore, along with He 131, Phe 132 is displaced 
from the binding cleft in IBV-X [by '-6.5 A for its Ca, 
compared with SARS-X and 229E-X, see Fig. 4(B)]. 
Through a mutational study, this phenylalanine was 
identified as one of only two amino-acid residues 
essential for the Appr-i"-pase activity of SARS-X. 19 
(The other was Asn 41, which is present in both 229E- 
X and IBV-X, in the same orientation as in SARS-X). 
In view of the conformational changes described, it is 
no surprise that IBV-X (strain Beaudette) does not 
bind ADP-ribose to a significant extent. 

In 229E-X, He 125 and Gly 43 are positioned 
opposite to one another, similarly to the situation in 
SARS-X, but their distance is >6 A, so that no close 
contact exists [see Fig. 4(B)]. Yet, the hydrogen bonds 
between the main-chain amides of the 122 SCGIFG 127 
and the phosphates as well as the distal ribose of 
ADP-ribose can all be made, as demonstrated in the 
structure of the highly similar X-domain of human co¬ 
ronavirus NL63 with ADP-ribose (Piotrowski et ah, in 
preparation). In agreement with this, we were able to 
measure significant binding of ADP-ribose to 229E-X 
(see Fig. 1). 
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Discussion 

The crystal structures of the X-domains of HCoV 229E 
and IBV (strain Beaudette), a Group-i and a Group-3 
coronavirus, reveal high similarity to the correspond¬ 
ing domain of SARS-CoV, which belongs to Group 
2b. 25 They are also similar to the macrodomains of 
Archaeoglobus fulgidus (AF1521; Ref. 21), the nonca- 
nonical histone macroH2A, 26,27 and the yeast protein 
Ymx7- 23 In fact, the closest structural homologue to 
IBV-X is the macrodomain protein Er.58 of E. coli 
(r.m.s.d. 1.54 A; PDB ID lSPV). Like IBV-X, this pro¬ 
tein also lacks the first (1-strand (fii); however, it 
should be able to bind ADP-ribose, as its triple-Gly 
sequence is conserved (whereas in IBV-X strain Beau¬ 
dette, Gly 47 is replaced by serine). IBV-X shares 20% 
sequence identity with AF1521, 15% with the N-termi- 
nal domain of Ymx7, and 15% with Ers8. 

Many Nsps, including domains of Nsp3, have 
been shown to nonspecifically bind to RNA (see Ref. 7, 
for a review). This has also been discussed for the 
X-domain of SARS-CoV. However, while the latter is 
basic (calculated pi = 8.1), this is not true for the 
X-domains of HCoV 229E (pi = 6.1) and IBV (pi = 
5.7), making a general nucleic acid-binding function 
less likely. 

Both 229E-X and SARS-X, as well as the X-do¬ 
main of TGEV, another Group-i coronavirus, have 
been shown to display Appr-i"-pase activity. 17-19,28 
However, by mutating Asn 36, which is essential for 
the Appr-i"-pase function, it was demonstrated that 
this enzymatic activity is not important for the HCoV- 
229E viability in cell culture. 17 In eukaryotes, the 
Appr-i"-pase activity is involved in tRNA splicing. 29,30 
Likewise, the X-domain through its Appr-i"-pase activ¬ 
ity was implicated in the regulation of viral RNA syn¬ 
thesis. 4 In the tRNA metabolic pathway, the Appr-i"- 
pase works in concert with ADP-ribose i",2"-cyclic 
phosphodiesterase (CPD). A similar cooperation was 
envisioned for a fraction of coronaviruses which 
encode a CPD homologue. 4,31,32 However, the turnover 
constant, /c cat , measured for the Appr-i"-pase activity 
of isolated coronaviral X-domains, is unusually low 
(5-20 min -1 ; Ref. 28) for an enzyme involved in met¬ 
abolic regulation. Although the /c cat for the full-length 
Nsp3 remains unknown, this observation was used to 
question a direct role of the coronavirus X-domain in 
viral RNA synthesis.' 9 

If the X-domain does not participate in coronavi¬ 
rus RNA replication, its involvement in the regulation 
of virus-host interaction has to be considered. Eriks¬ 
son et al . 33 have shown that mice infected with mutant 
MHV, which carried the aforementioned Asn 41- Ala 
mutation (SARS-X numbering) in the X-domain, 
exhibited reduced cytokine production, decreased liver 
pathogenesis, and increased interferon-a production. 
We also note that other domains of Nsp3 are involved 
in interaction with host-cell components. The PL pro of 
SARS-CoV has been shown to block type-I interferon 


production in the host. 34 The SARS-unique domain 
(SUD) of SARS-CoV Nsp3 has been proposed to coun¬ 
teract host-cell apoptosis at the translational level by 
binding to oligo(G) stretches in the 3'UTR of the 
mRNA of the apoptosis-related protein Bbc3, or to 
interrupt type-I interferon production by binding to 
similar regions of mRNA coding for signaling proteins 
in the MAP-kinase cascade. 35 For SARS-X, Egloff 
et al . 19 have demonstrated poly(ADP-ribose)-binding 
properties, suggesting that the domain may play a role 
in signaling pathways involving this polymer. 

Poly(ADP-ribosylation) is an important posttrans- 
lational modification catalyzed by the enzyme family 
of the poly(ADP-ribose) polymerases (PARPs). The 
prototype of the family, PARP-i, is involved in the rec¬ 
ognition of DNA damage. Binding to DNA single¬ 
strand and double-strand breaks stimulates PARP-i to 
catalyze the synthesis of linear and branched polymers 
of ADP-ribose from the substrate nicotinamide-ade- 
nine-dinucleotide (NAD + ), with concomitant release of 
nicotinamide. 36 PARP-i is also a major regulator of 
apoptosis; this is, at least in part, because of its func¬ 
tion as a coactivator of NF-KB-dependent gene expres¬ 
sion. 37,38 PARPs are primarily localized in the nucleus, 
but certain members of the family appear to act in the 
cytoplasm, at least partly. 39 

Several viruses have evolved mechanisms to 
counteract the apoptotic activity of PARPs. The viral 
protein R (Vpr) of HIV-i has been reported to form a 
ternary complex with the glucocorticoid receptor and 
PARP-i, thereby preventing translocation of the latter 
to the nucleus. 40 Poliovirus, SARS coronavirus, and 
Infectious Bronchitis Virus have been shown to 
induce proteolytic cleavage of PARP-i. 41-43 We specu¬ 
late that the viral X-domains could also have an 
adverse affect on the activity of cytoplasmic PARPs, 
either by binding to the poly(ADP-ribose) chains that 
PARPs not only attach to target proteins but also to 
themselves, or by direct interaction. Such a direct 
interaction between the parent macrodomain, that of 
histone macroH2A, and nuclear PARP-i has recently 
been shown to decrease the catalytic activity of the 
enzyme. 44,45 

Our data show that the IBV-X (strain Beaudette) 
fails to bind ADP-ribose. This observation is explained 
by the three-dimensional structure of the protein. The 
replacement by serine of the central glycine (Gly 47) of 
the triple-Gly sequence that forms part of the ADP- 
ribose binding site, pushes away the opposite lie 131 
and leads to a major conformational change of the 
loop (L11) carrying this residue (see Fig. 4), resulting 
in the loss of as many as five hydrogen bonds and one 
hydrophobic interaction with ADP-ribose. Therefore, 
we conclude that not all coronavirus X-domains bind 
ADP-ribose or poly(ADP-ribose) and suggest that 
either the X-domain in IBV (strain Beaudette) is a re¬ 
lict of evolution, or it has another function remaining 
to be characterized. 
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Materials and Methods 

Domain prediction, cloning, expression, 
and purification 

The X-domains of HCoV 229E and IBV (correspond¬ 
ing to ppia/ppiab residues 1270-1434 and 1005-1176, 
respectively) were predicted in Nsp3 using pan-coro- 
navirus multiple sequence alignment generated with 
support from the Viralis software platform (Gorbale- 
nya et ai, unpublished) and previously characterized 
X-domains. 17-19 Domains were PCR-amplified from 
viral RNA or cDNA and cloned into the pET-Mn vec¬ 
tor and expressed with an N-terminal extension 
(MKHHHHHHPMSDYDIPTTENLYFQGAMA), includ¬ 
ing a His 6 tag for purification and a TEV protease 
cleavage site (ENLYFQ^G) for removal of the tag. The 
two proteins were produced and purified in similar 
ways. The respective plasmid was transformed into the 
competent E. coli BL21 (DE3) strain. Cultivation in lx 
TY medium was performed at 37°C. Gene expression 
was induced by addition of 0.5 m M IPTG when cells 
had reached an optical density of 0.5 at 660 nm. 
Expression was carried out at 20°C for 5 h. Cells were 
harvested by centrifugation for 15 min at 72009 and 
4°C. The cell pellets were resuspended in 50 mM Tris, 
300 m M NaCl, 20 mM imidazole, 10% glycerol, pH 
8.0, for 229E-X and 50 m M Tris, 300 m M NaCl, 20 
m M imidazole, pH 8.0, for IBV-X, each including 10 
mg lysozyme and one Complete EDTA-free protease 
inhibitor cocktail tablet (Roche Diagnostics, Mann¬ 
heim, Germany). Cells were broken by French press. 
After ultracentrifugation (150,0009, 1 h, 4°C) of the 
sample, the supernatant was applied to a His-Trap FF 
5-mL column (GE Healthcare, Freiburg, Germany) 
with a flow rate of 1 mL/min. After washing with the 
buffer mentioned earlier, the protein was eluted with a 
linear gradient of 20-500 m M imidazole. For IBV-X, 
the buffer was changed to cleavage buffer (50 m M 
Tris, 0.5 m M EDTA, 1 m M dithiothreitol (DTT), pH 
8.0) by size exclusion on a Superdex 75 column (GE 
Healthcare, Freiburg, Germany). TEV protease and 
X-domain carrying an N-terminal His 6 tag were com¬ 
bined to a molar ratio of 1 : 40 (protease: tagged IBV- 
X) and the sample was incubated for 16 h at 20°C. To 
separate tag-free IBV-X from the His 6 tag and the TEV 
protease, the protein solution was applied to a His- 
Trap FF 5-mL column (GE Healthcare, Freiburg, Ger¬ 
many) with a flow rate of 1 mL/min. In case of 229E- 
X, the His 6 -tag-free protein tended to precipitate after 
incubation with the TEV protease at 20°C and even at 
4°C. Therefore, purification of this X-domain was car¬ 
ried out without removal of the His 6 tag and the pro¬ 
tein was applied to the Superdex 75 column (GE 
Healthcare, Freiburg, Germany) for size exclusion and 
buffer exchange (10 m M Tris, 100 m M NaCl, 1 m M 
DTT, pH 7.5) directly after Ni 2+ -NTA affinity chroma¬ 
tography. IBV-X was treated in the same way after the 
cleavage. For further details, see Piotrowski et al . 46 


Crystallization, structure determination, 
and refinement 

Crystallization screening was performed using a Phoe¬ 
nix robot (Dunn Labortechnik, Thelenberg, Germany) 
employing the sitting-drop vapor-diffusion technique. 
Drops containing 0.26 pL protein solution and 0.28 
pL reservoir solution were equilibrated against 85 pL 
reservoir solution. Initial hits were optimized the same 
way. 

IBV X-domain. Protein crystals appeared within 1 
day at 12°C and a protein concentration of 11-13 mg/ 
mL. Two different crystal forms could be observed 
when PEG 4000 or 8000 (at pH 7.5 or 8.5) and am¬ 
monium sulfate (at pH 5.6) were used as precipitants, 
respectively. When PEG was used as precipitant, crys¬ 
tal growth required the presence of 0.2 M Mg 2+ , a PEG 
concentration of 20-30%, and pH 7.5 or 8.5. When 
ammonium sulfate was used instead, crystal growth 
was restricted to a narrow pH range around 5.6. The 
quality of the latter crystals improved upon addition of 
200 mM K-,Na-tartrate to the crystallization experi¬ 
ment (see Table I for details). For diffraction data col¬ 
lection of crystals grown from PEG 4000, no addi¬ 
tional cryoprotectant was needed, whereas 30% 
glycerol had to be added as cryoprotectant when 
(NH 4 ) 2 S 0 4 was used as precipitant. 

HC0V-229E X-domain. For crystallization, a pro¬ 
tein concentration of 10-13 mg/mL was used. Protein 
crystals appeared within 2 days at 12°C when PEG 
400, 4000, 6000, 8000, or 10,000 in a concentration 
of 10-30% was used as precipitant at pH 7.5 to 8.5 
(see Table I for details). The cryoprotectant was 30% 
PEG 400. 

Collection and processing of diffraction 
data. X-ray diffraction data from crystals of IBV-X 
were collected using synchrotron radiation of X = 
0.808 A at beamline X13 (University of Hamburg - 
University of Liibeck - EMBL) at DESY (Deutsches 
Elektronen-Synchrotron), Hamburg, Germany. Data 
collection from crystals of 229E-X was carried out 
using synchrotron radiation (>. = 1.04 A) at beamline 
I911, MAX-lab, Lund, Sweden. Diffraction data for 
IBV-X8.5 were processed, reduced, and scaled with 
DENZO/SCALEPACK (HKL Suite; Ref. 47). Data for 
229E-X and IBV-X5.6 were processed with 
MOSFLM 48 and scaled with the programm SCALA 
from the CCP4 suite. 49 

Structure determination and refinement. The 

structures were determined by molecular replacement 
using the CCP4 program Phaser. 50-52 For HCoV 229E, 
the X-domain of HCoV NL63 (Piotrowski et al., in 
preparation; PDB ID 2VRI) was used as a search 
model. In case of IBV-X5.6, molecular replacement 
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was not straightforward and required the usage of 
eight superimposed coordinate sets. This ensemble 
included structures of similar fold as predicted by 
FFAS03 (Ref. 53; PDB codes: 1YD9, 1ZR3, 2DX6, 
iSPV, 1ZR5, 2ACF) and previously solved structures of 
the X-domain from HCoV NL63 (Piotrowski et al., in 
preparation) and HCoV 229E (this communication). 
Truncation of flexible regions in the search model was 
crucial to obtain a clear molecular replacement solu¬ 
tion. To improve the relatively poor overall electron 
density of the initial model, protein backbone confor¬ 
mations were remodeled, ambiguous side-chains were 
changed to alanine, and two helices were adjusted 
manually. The model was completed using ARP/ 
wARP. 54,55 The structure of IBV-X8.5 was determined 
by molecular replacement using the structure of IBV- 
X5.6 as a search model. Models were built into 
electron density using COOT, 56 and refined by 
REFMAC5. 57 Structure superimposition and calcula¬ 
tion of r.m.s. deviations were carried out using 
ALIGN. 58 

Atomic coordinates have been deposited with the 
Protein Data Bank (PDB ID of 229E-X: 3EJG, IBV- 
X8.5: 3EJF, and IBV-X5.6: 3EKE). 

Zone-interference gel electrophoresis 

The ZIGE experiment was based on Abrahams et al. 6 ° 
and optimized for the conditions used here. A 1% aga¬ 
rose gel in lx TBE buffer (1.08 g Tris, 0.55 g boric 
acid, 0.093 g Na-EDTA in 100 mL Aqua dest., pH 8.3 
adjusted using acetic acid) was prepared with large 
slots for the ligand solution and small slots for the 
protein/ligand solution. 61 After polymerization, the 
large slots were filled with 100 pL of ADP-ribose solu¬ 
tion (100 llM or 1 mill) including 10% DMSO/bromo- 
phenol blue (BPB). The small slots were filled with 10 
pL of the protein/ADP-ribose solution with a protein 
concentration of 10 pM, an ADP-ribose concentration 
of 100 pM or 1 mill, and an additional 10% DMSO/ 
BPB. The electrophoresis was performed in 0.5x TBE 
buffer at 100 mA and 4°C for 1 h. Afterward, the gel 
was incubated in denaturing solution (3.5% a-sulfosa- 
lycylic acid, 10% trichloroacetic acid) for 10 min, fol¬ 
lowed by incubation in washing buffer (8% acetic acid, 
15% ethanol) for an additional 10 min. The gel was 
stained by Coomassie Brilliant Blue (0.25% Coomassie 
Brilliant Blue, 15% ethanol, 8% acetic acid, 10% meth¬ 
anol) for 20 min and destained. The figures were cre¬ 
ated using Pymol. 59 

Isothermal titration calorimetry 

Isothermal titration calorimetry was performed at 
25°C using a VP-ITC instrument (Microcal, Milton 
Keynes, United Kingdom). Reactions were carried out 
in 20 m M Tris, 100 m M NaCl, 10 mM MgCl 2 , 1 m M 
DTT, pH 7.5, for 229E-X and in TBE buffer, pH 8.3, 
for IBV-X, and successive injections of 1.5 m M ADP- 
ribose for 229E-X and 700 pM for IBV-X. The protein 


concentration was 33 p M for 229E-X and 50 pM for 
IBV-X. 
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Note added in proof 

After acceptance and typesetting of this publication, we 
became aware of a paper by Xu et al., e-published on 
Nov 05, 2008, by J. Virol, online ahead of print, that 
describes a crystal structure of the X-domain of IBV 
strain M41 in complex with ADP-ribose. In contrast to 
the X-domain of IBV strain Beaudette, which is the 
subject of the present paper, residue 47 is a glycine 
rather than a serine in the protein of IBV strain M41. 
Thus, the canonical triple-glycine sequence seen in 
other coronavirus X-domains is conserved and ADP- 
ribose binding is not prevented, in contrast to the X- 
domain of IBV strain Beaudette. The observations 
described in the Xu et al. publication thus support our 
conclusions. 
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